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DECOUPLING TCP/IP PROCESSING IN SYSTEM AREA NETWORKS 



The invention relates to decoupling Transmission 
Control Protocol / Internet Protocol (TCP/IP) processing in 
system area networks (SANs) . 

SANs provide computer network clients with access to 
computer applications and services stored on application 
nodes. Network clients typically utilize Transmission 
Control Protocol/Internet Protocol (TCP/IP) to communicate 
with application nodes. Application node operating systems 
have been responsible for processing TCP/IP packets. 
TCP/IP processing demand at application node resources can 
slow down the application processing speed. 



FIG. 1 illustrates a system area network. 
FIG. 2 illustrates a proxy node according to the 
invention . 

FIG. 3a - 3g are flowcharts according to the 
invention . 

FIG. 4 is a flowchart according to the invention. 
FIG. 5 is a timeline of communications. 
FIG. 6 is a state diagram. 
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DETAILED DESCRIPTION 
FIG. 1 illustrates a computer system 10 including 
network clients 12, a system area network (SAN) 14 and a 
SAN management node 22. The network clients 12 can be 
configured, for example, to access services provided by 
application nodes 20a, 20b, 20c ... 20k through either a 
local area network (LAN) or a wide area network (WAN) . The 
SAN 14 has one or more network nodes 16a ... 16k, one or more 
proxy nodes 18a ... 18k, and one or more application nodes 
20a, 20b, 20c ... 20k. Each node includes at least one 
processor, a memory unit, and at least one network 
connection port. 

The network nodes 16a ... 16k are platforms that provide 
an interface between the network clients 12 and the SAN 14. 
The network nodes 16a ... 16k may be configured to perform 
load balancing across multiple proxy nodes 18a ... 18k. 

The proxy nodes 18a ... 18k are platforms that can 
provide various network services including network firewall 
functions, caching functions, network security functions, 
and load balancing. The proxy nodes 18a ... 18k also perform 
TCP/IP processing on behalf of the application nodes 20a, 
20b, 20c ... 20k. The proxy node 18a may, for example, 
include a computer configured to accomplish the tasks 
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described below. The application nodes 20a, 20b, 20c ... 20k 
are platforms that function as hosts to various 
applications, such as a web service, mail service, or 
directory service . 

SAN channels 24 interconnect the various nodes. SAN 
channels *24 may be configured to connect a single network 
node 16a ... 16k to multiple proxy nodes 18a ... 18k, to 
connect a single proxy node 18a ... 18k to multiple network 
nodes 16a ... 16k and to multiple application nodes 20a, 20b, 
20c ... 20k, and to connect a single application node 20a, 
20b, 20c ... 20k to multiple proxy nodes 18a ... 18k. 

In FIG . 1, the network clients 12 utilize TCP/IP to 
communicate with the network nodes 16a ... 16k and proxy 
nodes 18a ... 18k. A TCP/IP packet enters the SAN 14 at a 

network node 16a and travels through a SAN channel 24 to a 

j 

proxy node 18a. The proxy node 18a processes and 
translates the TCP/IP packet using a lightweight protocol. 

The term "lightweight protocol" refers to a protocol 
that has low operating system resource overhead 
requirements. Examples of lightweight protocols include 
Winsock-DP Protocol and Credit Request /Response Protocol. 
If required, one or more lightweight protocol messages 
travel through another SAN channel 24 to an application 
node 20a. 



3 



Attorney Do< 




No.: 10559/370001/P10176 



Packets also can flow in the opposite direction, 
starting, for example, at the application node 20a as a 
lightweight protocol message. The lightweight protocol 
message travels through a SAN channel 24 to the proxy node 
18a. The proxy node 18a processes the lightweight protocol 
message and if necessary, translates the message into one 
or more TCP/IP packets. The TCP/IP packets then travel 
from the proxy node 18a to a network node 16a through a SAN 
channel 24. The TCP/IP packets exit the SAN 14 through the 
network node 16a and are received by the network clients 



A single TCP/IP packet may be translated into one or 
more lightweight protocol messages, a single lightweight 
protocol message may be translated into one or more TCP/IP 
packets, multiple TCP/IP packets may be translated into a 
single lightweight protocol message, or multiple 
lightweight protocol messages may be translated into a 
single TCP/IP packet. A single TCP/IP packet may not 
generate any lightweight protocol messages, and a single 
lightweight protocol message may not generate any TCP/IP 
packets . 

As shown in FIG. 2 each proxy node, such as the proxy 
node 18a, incorporates the functions of TCP/IP processing, 
protocol translating and lightweight protocol processing. 
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The proxy node 18a includes two interfaces 34, 30 for 
connection to other SAN 14 components. A TCP/IP packet 
enters or exits the proxy node 18a at the interface 34. A 
TCP/IP processing module 32 accomplishes TCP/IP processing. 
A protocol translation engine 26 translates the TCP/IP 
packets and communicates the data using a lightweight 
protocol. A lightweight protocol processing module 28 
accomplishes lightweight protocol processing. Lightweight 
protocol messages exit or enter the proxy node 18a at the 



U 

*fi 10 interface 30. 

i 2 



Proxy nodes 18a ... 18k also create or destroy SAN 
channels 24 between the network nodes 16a ... 16k and the 
applications nodes 20a, 20b, 20c ... 20k. Proxy nodes 18a ... 
18k also create or destroy TCP endpoints. Proxy nodes also 



□ 

m 

ftj 15 relay TCP/IP byte streams arriving from network clients 12 



through network nodes 16a ... 16k, and addressed to 
application nodes 20a, 20b, 20c ... 20k using a lightweight 
protocol. Proxy nodes 18a ... 18k also communicate 
lightweight protocol data arriving from application nodes 
20 20a, 20b, 20c ... 20k to the network clients 12 through 
network nodes 16a ... 16k using TCP/IP. 

The proxy node 18a also may incorporate other standard 
proxy node services 36 including caching, firewall, and 
load balancing. 
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Proxy nodes 18a ... 18k communicate with network nodes 

16a ... 16k and application nodes 20a ... 20k using SAN 

channels 24. SAN channels 24 may be established at the time 

of service startup (i.e. the initial offering of an 

5 application's services on a proxy node 18a ... 18k). SAN 

channels 24 may be destroyed, for example during a service 

shutdown, for SAN 14 resource management reasons, or due to 

a catastrophic error occurring in a SAN 14. 

Referring to FIG. 3a - 3g and FIG. 4, proxy nodes 18a 

\Q 10 ... 18k perform protocol processing on behalf of application 
Sj 

IJl nodes 20a, 20b, 20c ... 20k. 

IS 

A proxy node 18a may receive 50, for example, a 
JOIN__SERVICE message 52 from an application node 20a, 20b, 



n 20c ... 20k. Proxy nodes 18a ... 18k maintain lists of 

m 

IU 15 application nodes that may be accessed through the proxy 

h ■ 

}^ node 18a ... 18k. The JOIN_SERVICE message 52 indicates, 

that a particular application node 20a should be included 
on the list of application nodes offered by the proxy node 
18a and be available for accessing by a network client 12. 
20 Upon receiving the JOIN_SERVICE message, the proxy node 18a 
determines 54 whether a corresponding TCP endpoint already 
exists on the proxy node 18a for that application service. 
If a corresponding TCP endpoint exists, the proxy node 18a 
adds 56 the application node 20a to the list of application 
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nodes associated with that service and ends the process 
278. If a corresponding TCP endpoint does not exist, the 
proxy node 18a creates 58 a corresponding TCP endpoint 
(with associated IP address and TCP port number) and sets 
5 60 the TCP endpoint in a TCP LISTEN state. The proxy node 
18a adds 56 the application node 20a to the list of 
application nodes associated with that service and then 
ends 27 8 the process . 

Various TCP states are available. A CLOSED TCP state 
10 indicates that a TCP endpoint is closed. A LISTEN TCP 
state indicates that the TCP endpoint is listening. A 
SYNJSENT TCP state indicates that a SYN packet has been 
t?=5 sent on the TCP endpoint. A SYN_RCVD state indicates that 

i; 

yf a SYN signal has been sent, a response has been received on 

m 

IU 15 a TCP endpoint, and receipt of an acknowledgement (ACK) 
Q 

M signal is pending. An ESTABLISHED state indicates that a 

connection has been established and that data is being 
transferred. A CLOSE_WAIT state indicates that a finish 
(FIN) signal has been received and an application is 
20 closing. A FIN_WAIT_1 state indicates that the endpoint 
has closed, a FIN signal has been sent to an application, 
and receipt of an ACK and FIN is pending. A CLOSING state 
indicates that an application is closing and the TCP 
endpoint is awaiting receipt of an ACK. A LAST ACK state 
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indicates that a FIN TCP packet has been received, an 
application has closed, and the TCP endpoint is awaiting 
receipt of an ACK. A FIN_WAIT_2 state indicates that an 
application has closed and the TCP endpoint is awaiting 
receipt of a FIN signal. A TIME_WAIT - 2MSL (maximum 
segment lifetime) state is a wait state for a TCP -endpoint 
after actively closing. 

A proxy node 18a ... 18k, for example proxy node 18a, 
may receive a LEAVE_SERVICE message 62 from an application 
node 20a, 20b, 20c ... 20k, for example application node 20a. 
A LEAVERS ERVICE message 62 indicates that a particular 
application node 20a should be removed from the list of 
application nodes 20a, 20b ... 20k that may be accessed 
through a particular proxy node 18a. Upon receiving a 
L E A VE_ SERVICE message 62, the proxy node 18a removes 70 the 
application node 20a from the list of accessible 
application nodes. The proxy node 18a then determines 64 
whether the list of application nodes available through the 
proxy node 18a is empty. If the list is empty, the proxy 
node 18a closes 66 the corresponding TCP endpoint and 
cleans 68 any resources associated with the corresponding 
TCP endpoint. The proxy node 18a then ends 278 the 
process . 
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A proxy node 18a ... 18k, for example proxy node 18a, 
may receive a CONNECT I ON_REQUEST message 72 from an 
application node 20a, 20b ... 20k, for example application 
node 20a. A CONNECT I 0N_REQUE ST message indicates that an 
5 application node 20a wants to create a connection with a 
network client 12. The proxy node 18a creates 74 a 
corresponding TCP endpoint, the proxy node 18a then 
actively opens 76 the TCP endpoint, and sets 78 the TCP 
endpoint to a TCP SYN_SENT state. The proxy node 18a then 

0 

iO 10 sends 80 a TCP SYN packet to the corresponding network 
fij client 12, through a network node 16a ... 16k. 

i . s 

;^ A proxy node 18a ... 18k, for example proxy node 18a, 

may receive an ACCEPT_CONNECTION message 82 from an 
application node 20a, 20b ... 20k, for example application 
jy 15 node 20a. An ACCEPT_CONNECTION message 82 is sent by an 
application node 20a to accept a connection request. The 
proxy node 18a updates 84 the TCP endpoint' s connection 
information and directs 86 subsequent TCP flow to that 
application node 20a. The proxy node 18a then ends the 
20 process. 

A proxy node 18a ... 18k, for example proxy node 18a, 
may receive a RE JECT_CONNECTION message 88 from an 
application node 20a, 20b ... 20k, for example application 
node 20a. The proxy node 18a closes 90 the corresponding 
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TCP connection and sends 92 a TCP RST packet to a network 
client 12 through a network node 16a ... 16k. The proxy node 
18a then ends 278 the process. 

A proxy node 18a ... 18k, for example proxy node 18a, 
may receive DATA 94 from an application node 20a ... 20k, for 
example application node 20a. The proxy node 18a stores 96 
the DATA on a send queue of the corresponding TCP endpoint. 
The proxy node then determines 302 whether a TCP/IP packet 
can be sent on the TCP endpoint. If not, the proxy node 
18a ends 278 the process. If a TCP/IP packet can be sent, 
the proxy node 18a computes 304 the size of the packet that 
can be sent. The proxy node 18a constructs 306 the TCP/IP 
packet header and sets 308 the TCP flags in the TCP/IP 
packet according to the TCP state. The proxy node 18a then 
determines 310 if the data can be sent in the packet. If 
it can be sent, the proxy node 18a extracts 312 the data 
from the transmission control block (TCB) send queue and 
sends 313 the TCP/IP packet to a network client 12 through 
a network node 16a. A TCB is used by TCP/IP to store 
various information related to a particular TCP endpoint. 
TCBs typically contain information such as foreign and 
local IP addresses, foreign and local port numbers, options 
for endpoints, state information for each TCP endpoint, 
sequence numbers, window sizes, retransmission timers, etc. 
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A proxy node 18a ... 18k, for example proxy node 18a, 
may receive a CLOSE_CONNECTION message 102 from an 
application node 20a, 20b ... 20k, for example application 
node 20a. The proxy node determines 107 whether the TCP 
endpoint is in the LISTEN or SYN_SENT state. If the TCP 
endpoint is in either of these states, the proxy node 18a 
closes 114 the TCP endpoint and ends 278 the process. If 
the TCP endpoint is not in one of these states, the proxy 
node 18a determines 109 whether the TCP endpoint is in the 
SYN_RCVD, ESTABLISHED, or CLOSE_WAIT state. If it is, the 
proxy node then determines 103 whether there is 
unacknowledged data on the TCB send queue. If there is no 
unacknowledged data, the proxy node 18a determines 104 
whether the TCP endpoint is in the SYN_RECVD or ESTABLISHED 
state. If it is, the proxy node 18a changes 106 the TCP 
state to FIN_WAIT_1. If the TCP endpoint is in the 
CLOSE_WAIT state, the proxy node 18a changes 110 the state 
of the TCP endpoint to LAST_ACK . The proxy node 18a then 
sends 112 a TCP FIN packet to the corresponding network 
client 12, through a network node 16a ... 16k, and eventually 
closes the TCP/IP connection. If it is determined (in 
block 103) that there is unacknowledged data on the TCB 
send queue, the proxy node 18a marks 105 the 
CLOSE_CONNECTION_RECEIVED flag for the TCP endpoint and 
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proceeds to determine whether a TCP /IP packet can be sent 
on the TCP endpoint (block 302) . 

If a TCP endpoint needs to be shutdown on a particular 
proxy node 18a, the proxy node sends a SHUTDOWN_SERVICE 
message to the application node associated with that TCP 
endpoint. When the application node 20a receives a 
SHUTDOWN_SERVICE message from the proxy node 18a, the 
application node performs appropriate service cleanup, 
including shutting down the application in case any proxy 
service is unavailable. 

Proxy nodes 18a ... 18k process TCP/IP packets received 
from network clients 12 through network nodes 16a ... 16k. 

A proxy node 18a may receive 150 a TCP/IP packet from 
a network client 12 through a network node 16a. The proxy 
node 18a typically performs 152 a TCP checksum process. If 
the TCP checksum process result is unacceptable, the proxy 
node 18a drops 280 the packet and ends 278 the process. 

If the result of the checksum process is acceptable, 
the proxy node attempts to identify 154 the TCP connection 
associated with the packet. If a TCP endpoint corresponding 
to TCP connection associated with the packet is identified, 
then the proxy node 18a continues with block 156. If no 
TCP connection is identified, the proxy node 18a determines 
140 whether the SYN flag is set on the packet. If the SYN 
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flag is not set, the proxy node 18a resets 144 the 
connection and ends 278 the process. If the SYN flag is 
set, the proxy node 18a determines 142 whether the 
corresponding TCP endpoint is in the LISTEN state. If it is 
5 not, the proxy node 18a resets 144 the connection and ends 
278 the process. If the corresponding TCP endpoint is in 
the LISTEN state, the proxy node 18a creates 146 a new TCP 
endpoint. The proxy node then duplicates the TCB 
information and continues the process in block 156. 
=Q 10 If a TCP connection is identified, the proxy node 18a 

IP determines 156 whether a reset (RST) flag on the TCP packet 

m 

j*{ is set. 

In one implementation the TCP header contains six flag 

Q 

bits, although in other cases there may be fewer or more 

jlj 

iy 15 flag bits in the TCP header. An URG flag bit indicates 

Q 

whether a TCP packet is urgent. An ACK flag bit indicates 
whether the TCP packet acknowledgement number is valid. A 
PSH flag bit indicates whether a proxy node 18a ... 18k 
should pass the packet data to an application node 20a, 
20 20b, 20c ... 20k as soon as possible. A RST flag bit 

indicates whether the TCP endpoint should be reset. A SYN ' 
flag bit indicates whether sequence numbers should be 
synchronized to initiate a connection. A FIN flag bit 
indicates whether the sender is finished sending data. 
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If the RST flag is set, the proxy node 18a resets 158 
the TCP/IP endpoint. The proxy node 18a then determines 
160 whether a CLOSE_CONNECTION message should be sent to 
the application node 20a. If such a message is needed, the 
5 proxy node 18a sends 162 a CLOSE_CONNECTION message to the 
application node 20a. The proxy node 18a then ends 278 the 
process. If a CLOSE_CONNECTION message is not required, 
the proxy node 18a simply ends 278 the process. If a RST 
flag is not set in the TCP/IP packet, the proxy node 18a 
0 10 determines 157 whether the TCP endpoint is in a LISTEN 

state. If not, the proxy node 18a processes 159 the TCP 
options . 

The proxy node 18a also determines 164 whether a SYN 



m 
m 
w 



flag is set in the TCP packet. If the proxy node 18a 

iu 

fl| 15 determines 164 that the SYN flag is set, the proxy node 18a 

5 

U determines 166 whether the TCP endpoint is in a LISTEN 

state. If the TCP endpoint is in the LISTEN state, the 
proxy node 18a initializes 168 the associated TCB. 

After initializing 168 the associated TCB, the proxy 
20 node 18a sends 170 a TCP SYN+ACK packet to the network 

client 12 and ends 278 the process. If, on the other hand, 
the TCP endpoint is not in the LISTEN state, the proxy node 
18a determines 167 whether the TCP endpoint is in the 
SYN_SENT state. If the TCP endpoint is not in the SYN SENT 
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state, the proxy node 18a sends a RST to a network client 
12 through the network node 16a. If the TCP endpoint is in 
the SYN_SENT state, the proxy node 18a determines 172 
whether an ACK flag is set in the TCP packet. If the ACK 
flag is set, then the proxy node 18a changes 174 the TCP 
endpoint state to ESTABLISHED. The proxy node 18a then 
sends 176 a TCP ACK packet to the network client 12. The 
proxy node 18a identifies 178 the application node 20a 
corresponding to the TCP endpoint, and determines 180 
whether delayed binding is required. If delayed binding is 
not required, the proxy node 18a determines 181 if an 
ACCEPT_CONNECTION message is required. If an 
ACCEPT_CONNECTION message is required, the proxy node 18a 
sends 182 an ACCEPT_CONNECTION message to the application 
node 20a. The proxy node 18a determines 185 if FIN or DATA 
is in the packet. If they are not, the proxy node 18a ends 
278 the process. If either FIN or DATA is in the packet, 
the process continues with block 184. If the TCP endpoint 
is in the SYN_SENT state and the ACK flag is not set in the 
TCP packet, the proxy node sets 284 the TCP endpoint state 
to SYN_RCVD and determines 18 5 whether FIN or DATA is 
included in the packet. 

The proxy node 18a determines 184 whether a received 
TCP packet received is the next TCP packet expected on a 
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particular TCP endpoint. If it is not, then the proxy node 
18a places 186 the TCP packet on a re-sequencing queue, and 
strips off 188 SYN/DATA/FIN . If the proxy node 18a 
determines that the packet received is the next packet 
expected, the proxy node 18a trims 185 any packet data not 
within the window, and determines 190 whether the TCP ACK 
flag is set in the packet. If the ACK flag is set, the 
proxy node 18a determines 189 if it is a duplicate ACK. If 
it is a duplicate ACK the proxy node 18a performs 191 a 
fast recovery algorithm. The proxy node then updates 192 
the TCB information and removes 193 ACKED data from the TCB 
send queue. The proxy node 18a then determines 194 whether 
the TCP endpoint is in the SYN_RCVD state. If the TCP 
endpoint is in the SYN_RCVD state, the proxy node 
determines 195 whether there is an ACK number error. If 
there is not an ACK number error, the proxy node 18a 
changes 196' the TCP endpoint state to ESTABLISHED. If 
there is an ACK number error, the proxy node 18a resets 199 
the connection and sends a RST to the network client 12 
through the network nodes 16a ... 16k. The proxy node 18a 
then ends 278 the process. 

After changing the TCP/IP endpoint state to 
ESTABLISHED, the proxy node 18a identifies 198 the 
application node 20a that corresponds to the TCP endpoint 
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and determines 200 whether delayed binding is required. If 
delayed binding is not required, the proxy node 18a 
determines 197 whether the corresponding TCP endpoint was 
opened passively. If the TCP endpoint was opened passively, 
the proxy node 18a determines 201 whether a 
CONNECT I ON_REQUE ST message is required. If a 
CONNECT I ON_REQUE ST message is required, the proxy node 18a 
sends a CONNECT ION_REQUEST message to the application node 
20a, if needed. The process continues with block 227. If 
the TCP endpoint was not opened passively, then the proxy 
node 18a determines 203 whether an ACCEPT_CONNECTION 
message is required. If an ACCEPT_CONNECTION message is 
required, then the proxy node 18a sends 205 an 
ACCEPT_CONNECTION message to the application node 20a. The 
process then continues with block 227. 

The proxy node 18a determines 204 whether the TCP 
endpoint is in the CLOSE_WAIT or ESTABLISHED state. If the 
TCP endpoint is in CLOSE_WAIT or ESTABLISHED state, the 
proxy node 18a determines 208 whether there is any 
unacknowledged data on the TCB send queue. The proxy node 
18a then determines 210 whether a CLOSE CONNECTION message 
already has been received from the application node 20a. 
The proxy node 18a sends 212 a FIN to the network client 12 
through the network node 16a and determines 213 whether the 
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TCP endpoint is in the ESTABLISHED state. If the TCP 
endpoint is in the ESTABLISHED state, the proxy node 18a 
changes 214 the TCP endpoint state to FIN WAIT 1. The 
process continues in with block 227. If the TCP endpoint is 
in the CLOSE_WAIT state, the proxy node 18a changes 215 the 
TCP endpoint state to LAST_ACK. The process continues with 
block 268 in which the proxy node 18a scans 268 the re- 
sequencing queue. 

The proxy node 18a may determine 216 that the TCP 
endpoint is in the FIN_WAIT_1 state. If the TCP endpoint 
is in the FIN_WAIT_1 state, the proxy node 18a determines 
217 whether the FIN is acknowledged (ACKED) . If it is not, 
the process continues with block 227. If the FIN is ACKED, 
the proxy node 18a changes 218 the TCP endpoint state to 
FIN_WAIT_2. The process then continues with block 227. 

The proxy node 18a determines 220 that the TCP 
endpoint is in the CLOSING state. If the TCP endpoint is 
in the CLOSING state, the proxy node 18a determines 219 
whether the FIN is ACKED. If it is not, the process 
continues with block 227. If the FIN is ACKED, the proxy 
node 18a changes 222 the TCP endpoint state to TIME_WAIT. 
The process then continues with block 227. 

The proxy node 18a may determine 224 that the TCP 
endpoint is in the LAST_ACK state. If the TCP endpoint is 
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in the LAST ACK state, the proxy node 18a determines 225 
whether the FIN is ACKED. If not, the process continues 
with block 227. If the FIN is ACKED, the proxy node closes 
226 the connection and cleans up the TCP endpoint. The 
proxy node 18a then ends 287 the process. 

As indicated by block 227, the proxy node 18a may 
process any TCP urgent data. The proxy node 18a then 
determines 229 if there is any data in the packet. If 
there is not, the proxy node proceeds to block 246. If 
there is data in the packet, the proxy node 18a processes 
228 the TCP/IP packet data. The proxy node 18a determines 
230 whether the corresponding TCP endpoint is in any of the 
TCP states SYN_RCVD, ESTABLISHED, FIN_WAIT_1 or FIN_WAIT_2 . 
The data is only accepted 234 in these states. Otherwise, 
the data is rejected 232. If the data is accepted 234, the 
proxy node 18a places 236 the data on the receive queue of 
the TCP endpoint. The proxy node 18a then strips 238 off 
the TCP/IP header. 

The proxy node 18a determines 282 if a data packet is 
the first data packet received on a particular TCP/IP 
endpoint. The proxy node 18a then determines 245 whether 
delayed binding is required. If it is not, the proxy node 
18a communicates 240 the data to an application node 20a. 
The proxy node 18a then determines 242 if the data had been 
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successfully communicated to the application node 20a. If 
the data had been successfully communicated, the proxy node 
18a removes 244 the data from the receive queue of the TCP 
endpoint . 

If the packet received is not the first packet 
received on the TCP/IP endpoint, the proxy node 18a 
communicates 241 the available data on the TCB receive 
queue to the application node. The proxy node 18a then 
removes 243 successfully communicated data from the TCB 
receive queue. The process continues with block 246. 

If delayed binding is required, the proxy node 
determines 247 whether the TCP/IP endpoint was opened 
passively. If it was opened passively, the proxy node 18a 
determines 233 whether a CONNECT ION_REQUEST is required. If 
a CONNECT I ON_REQUE ST is required, the proxy node 18a sends 
249 a CONNECTION REQUEST message to the application node 
20a. If the endpoint was not opened passively, the proxy 
node 18a determines 235 whether an ACCEPT_CONNECTION is 
required. If an ACCEPT_CONNECTION is required, the proxy 
node 18a sends 251 an ACCEPT CONNECTION message to the 
application node 20a. The process continues with block 240 
as described above. 

The proxy node 18a may determine 24 6 that a FIN flag 
is set in a particular TCP packet. If a FIN flag is set, 
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the proxy node 18a determines 248 if the TCP endpoint is in 
the SYN_RCVD or ESTABLISHED state. If the TCP endpoint is 
in either of these states, the proxy node 18a changes 250 
the TCP endpoint state to CLOSEJtfAIT. The proxy node 
determines 252 if a CLOSEjSONNECTION message should be sent 
to the application node 20a. If so, the proxy node 18a 
sends 254 the CLOSE_CONNECTION message to the application 
node 20a. 

The proxy node may determine 256 that the TCP endpoint 
is in FIN_WAIT_1 state. If the TCP endpoint is in the 
FIN_WAIT_1 state, the proxy node 18a determines 258 whether 
any unacknowledged data exists on the corresponding TCB 
send queue. If the TCP endpoint is in FIN_WAIT_1 state and 
there is no unacknowledged data, the proxy node 18a changes 
260 the TCP endpoint state to TIMEJtfAIT. Otherwise, the 
proxy node changes 262 the TCP endpoint state to CLOSING. 

The proxy node 18a may determine 264 that a TCP 
endpoint is in FIN_WAIT_2 state. If the TCP endpoint is in 
FIN_WAIT_2 state, then the proxy node 18a changes 266 the 
TCP endpoint state to TIME_WAIT . 

The proxy node 18a scans 268 the re-sequencing queue 
of the TCP endpoint and determines 269 whether the 
resequencing queue is empty. If it is not empty, the proxy 
node 18a de-queues 271 the next packet from the 
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desequencing queue and determines 273 whether the packet is 
obsolete 273. If it is obsolete, the proxy node 18a frees 
275 the packet and returns to block 275. If the proxy node 
18a determines that the resequencing queue is empty, the 
process continues with block 302 as described above. 

The proxy node 18a communicates data to the 
application node 20a using a lightweight protocol. The 
proxy node 18a determines 316 whether the TCB receive queue 
is empty. If the TCB receive queue is empty, the proxy node 
18a ends 278 the process. If the TCB receive queue is not 
empty, then the proxy node 18a determines 318 whether data 
on the receive queue can be communicated to the application 
node 20a. If the data cannot be communicated, the proxy 
node 18a ends 278 the process. If the data can be 
communicated, the proxy node 18a communicates 241 the 
available data on the TCB receive queue to the application 
node 20a. The proxy node 18a then removes communicated data 
from the TCB receive queue. The proxy node 18a may perform 
processes described above periodically as a part of timer 
functions . 

FIG. 5 shows a set of exemplary communications between 
a network client 12, a network node 16a, a proxy node 18a 
and an application node 20a. Time is shown along the 
vertical axis. The end components included within the 
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region identified by 320 communicate utilizing TCP/IP 
protocol. The end components included within the region 
identified by 330 communicate utilizing a lightweight 
protocol . 

The timeline in FIG. 5 begins with a network client 12 
issuing a SYN TCP/IP packet 380. The SYN packet 380 is a 
request from the network client 12 addressed to an 
application service on an application node 20a to establish 
a connection with a particular application service. This 
packet is passed through the network node 16a and is 
intercepted by the proxy node 18a. Since no information is 
required from the application node 20a, the proxy node 18a 
processes the TCP/IP SYN packet 380 and responds to the 
network client 12 with a SYN+ACK TCP/IP packet 382 that is 
sent through the network node 16a and that indicates that 
the SYN request 380 has been acknowledged. Each connection 
request is acknowledged before a connection can take place. 
The network client 12 receives the SYN+ACK signal 382 and 
responds with an ACK packet 384. The connection then is 
established between the network client 12 and the proxy 
node 18a. 

The network client 12 begins transmitting data 386 to 
the proxy node 18a through the network node 16a over the 
established connection. At that point, the proxy node 18a 
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realizes that involvement from the application node 20a is 
needed. Therefore, the proxy node 18a sends a CONNECTION 
REQUEST message 390 with or without the translated data to 
the application node 20a using a lightweight protocol. The 
application node 20a responds by issuing an ACCEPT 
CONNECTION lightweight protocol message 392 with any 
necessary data. The proxy node 18a processes and 
translates the message 392 ... 398 and communicates the 
translated data 394 ... 402 to the network client 12 through 
the network node 16a using TCP/IP . 

When the network client 12 receives data it sends 
acknowledgement ACK packets 404 ... 410 back to the proxy 
node 18a. When the application node 20a is finished 
sending all requested data, it sends a CLOSE CONNECTION 
message 406 to the proxy node 18a. Multiple lightweight 
protocol messages may be combined with each other. The 
proxy node 18a then sends a FIN TCP/IP packet 408 through 
the network node 16a to the network client 12 indicating 
that the connection should be terminated. The network 
client 12 acknowledges receipt of the signal 408 by sending 
a FIN+ACK TCP/IP packet 412 through the network node 16a to 
the proxy node 18a. The proxy node then sends an ACK 
packet 414 to the network client 12. 



24 



Attorney Do^^No.: 1 05 59/37000 1 /P 1 0 1 76 

FIG. 6 illustrates state transition diagrams for an 
IngressPool 504 buffer and an EgressPool 506 buffer. Proxy 
nodes, for example proxy node 18a, may be optimized for 
TCP/IP-to-lightweight protocol translation by using 
IngressPool 504 buffers and EgressPool 506 buffers to 
perform zero copy translation of data. The proxy node 18a 
may receive 500 data (from network clients 12 through 
network nodes 16a ... 16k) and may receive 502 data (from 
application nodes 20a ... 20k) simultaneously. The protocol 
translator 26 can be configured to perform zero-copy 
translation of data in a proxy node 18a. Other techniques 
also can be utilized to enable zero-copy translation of 
data. Proxy nodes 18a ... 18k may maintain two pools of 
registered buffers. IngressPool 504 buffers may store 
incoming data and EgressPool 506 buffers may store outgoing 
data. Both of these pools may be registered with each 
interface 30, 34 in the proxy nodes 18a ... 18k. The 
recommended size of the IngressPool 504 buffer may 
typically be determined by considering the maximum number 
of concurrent TCP connections to network clients 12 through 
network nodes 16a ... 16k supported, the maximum size of 
receive windows advertised to the network client 12, and 
the maximum number of outstanding receive descriptors on 
the SAN channels 24. The recommended size of the EgressPool 



25 



if! 

139 
W 



Attorney Do^^Slo.: 10559/370001 /PI 01 76 

506 buffer may typically be determined by considering the 

maximum number SAN channels 24 used for communication with 

application nodes 20a, 20b, 20c ... 20k, the maximum amount 

of credits available per SAN channel 24, the maximum number 

5 of concurrent TCP connections supported, and the maximum 

size of receive windows advertised to the network clients. 

Each buffer may be described by a memory buffer that tracks 

the memory handle, offset within a buffer, and the length 

of data. Each buffer can exist in one of several main 

10 states: posted (508 & 512), received (500 & 502), and freed 

(510 & 514). By maintaining the memory buffer structure 

and states of the buffer, the proxy node 18a may achieve 

zero-copy translation. 

r? Any higher layer service that executes policies above 

III 

j{§ 15 the transport layer can be built on top of the decoupling 

3 

M: technique on a proxy node 18a. For example, a layer 5 (L5) 

web switch that maintains Hyper Text Transfer Protocol 
(HTTP) 1.0, HTTP-S (Secure) 1.0, HTTP 1.1, HTTP-S 1.1 
connections with the network clients and HTTP 1.1 
20 connections with the application nodes 20a, 20b, 20c ... 20k 
can be built on top of a proxy node 18a. Additionally, 
HTTP 1.0, HTTP-S 1.0, HTTP 1.1, HTTP-S 1.1 data exchanged 
with network clients 12 may use TCP/IP; and HTTP 1.1 data 
exchanged between proxy nodes 18a ... 18k and application 
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nodes 20a, 20b ... 20k can use lightweight protocol . Each 
HTTP transaction from a network client 12 can be mapped 
onto a HTTP 1.1 transaction to an application node 20a, 20b 



In addition to the foregoing techniques, the proxy 
node 18a can be configured to perform other processing 
related to TCP/IP including, for example , timers , 
algorithms such as congestion control , slow start, fast 
retransmit , and nagle . 

Computer systems implementing these techniques may 
realize one or more of the following advantages. First, 
the techniques can result in faster SAN 14 operating speeds 
as a result of the elimination of system bottlenecking 
effects caused by extensive protocol processing overhead at 
the application nodes 20a, 20b, 20c ... 20k. Second, 
decoupling TCP/IP processing from the application nodes 
20a, 20b, 20c ... 20k to the proxy nodes 18a ... 18b can allow 
independent scaling of system TCP/IP processing 
capabilities and application processing capabilities. 
Third, since these techniques are typically not constrained 
by legacy API (sockets) , operating system environment, or 
hardware platform, systems may be optimized to meet both 
TCP/IP and lightweight protocol processing demands. 
Fourth, other value-added services, such as, load 
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balancing, caching, firewall, content transformation, and 
security protocol processing can be built on top of these 
decoupling techniques. SANs 14 incorporating the 
techniques described above can incorporate two levels of 
load balancing. At one level, the network nodes 16a ... 16k 
can perform session level load balancing on a group of 
proxy nodes 18a ... 18k using network address translation 
techniques or Internet protocol tunneling techniques. At a 
second level, each proxy node 18a ... 18k can perform 
application level load balancing on a group of application 
nodes 20a, 20b, 20c ... 20k. 

Furthermore, systems using the techniques described 
above can provide increased flexibility. In addition, 
failures incurred in TCP/IP processing may be treated 
independently from application failures, thus providing 
improved SAN 14 reliability. Additionally, resource 
contention between application processing demands and 
TCP/IP processing demands can be significantly reduced. 

Various features of the system may be implemented with 
hardware, software or with a combination of hardware and 
software. For example, some aspects of the system can be 
implemented in computer programs executing on programmable 
computers. Each program can be implemented in a high level 
procedural or object-oriented programming language to 
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communicate with a computer system. Furthermore, each such 
computer program can be stored on a storage medium, such as 
read-only memory (ROM) readable by a general or special 
purpose programmable computer, for configuring and 
operating the computer when the storage medium is read by 
the computer to perform the functions described above. 

Other implementations are within the scope of the 
following claims. 
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